Metrology applied to performance analysis using hardware counters
نویسندگان
چکیده
Performance evaluation and analysis are used to understand and improve computer system performance. One common approach is to measure performance indicators of interest during execution of a set of benchmark programs on a given system. Processor architecture, compiler, operating system, etc. and their interaction are often not predictable and can introduce random variability in such measurements. For instance, complex out-of-order processors with speculation may have a different memory utilization, instructions per cycle, pipeline stalls, branch prediction results, for each execution of a program. This raises the question of accuracy in collecting execution-based measurements. Modern microprocessors contain hardware performance counters to make such measurements and to collect dynamic information on the state of a processor [13], [15]. A common practice is to collect multiple measurements per counter and to report the average value and the standard deviation. However, only a limited number of hardware counters can be collect in one execution of a program. Therefore, the number of times a benchmark suite needs to be executed to collect all the counters of interest may be quite large. The question then is, what is the minimum number of samples required required to measure all parameters with a required accuracy? Another question is if measurements performed in separate benchmark executions can be combined. Finally, there is an issue of how to deal with measurement errors in systems that can collect only a small number of counters per run. The are the issues addressed in this work. We propose a methodology for hardware counter based measurements that answers the above
منابع مشابه
Obtaining Hardware Performance Metrics for the BlueGene/L Supercomputer
Hardware performance monitoring is the basis of modern performance analysis tools for application optimization. We are interested in providing such performance analysis tools for the new BlueGene/L supercomputer as early as possible, so that applications can be tuned for that machine. We are faced with two challenges in achieving that goal. First, the machine is still going through its final de...
متن کاملOptimization and Paralllelization Experiences Using Hardware Performance Counters
Current hardware for compute intensive tasks includes a large amount of processing facilities which is sometimes hard to use in an optimized way. High performance computing (HPC) is always focused in solving grand challenge (or, at least, compute intensive) problems for which the response time is the priority. We have been working from two different but usually complementary research problems: ...
متن کاملPC Desktop Performance and Hardware Performance Counters
The paper begins by describing the evolving architecture of desktop PCs and the performance metrics of interest in evaluating them. It then describes some of the more interesting counters, references an earlier study using the counters, talks a little bit about the software to enable use of these counters, and describes typical values of instructions per cycle and other metrics of interest runn...
متن کاملA Scalable Cross-Platform Infrastructure for Application Performance Tuning Using Hardware Performance Counters
The purpose of the PAPI project is to specify a standard API for accessing hardware performance counters available on most modern microprocessors. These counters exist as a small set of registers that count "events", which are occurrences of specific signals and states related to the processor's function. Monitoring these events facilitates correlation between the structure of source/object cod...
متن کاملArchitecting Cpi Component Performance Counters
......A key application of user-visible hardware performance counters is to give the software developer clear, accurate, and useful hardware-related performance information. This information provides guidance on software changes that can improve performance. An intuitively appealing way to represent the major performance components is to quantify their contributions to the average cycles per in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010